fix: embedding fixup for nomic-embed-text model #504

vansangpfiev · 2024-04-12T13:14:20Z

For model like nomic-embed-text-v1.5.f16.gguf, we skip warm up.
Also change the API to send_embedding for non-logits model.

vansangpfiev · 2024-04-15T09:42:55Z

context/llama_server_context.h

    } else {
-      const float* data = llama_get_embeddings(ctx);
-      std::vector<float> embedding(data, data + n_embd);
+      std::vector<float> embd_res(n_embd, 0.0f);


API change here for non-logit model

vansangpfiev · 2024-04-15T09:54:58Z

controllers/llamaCPP.cc


    params.n_gpu_layers = jsonBody->get("ngl", 100).asInt();
    params.n_ctx = jsonBody->get("ctx_len", 2048).asInt();
+    is_embedded_model =


use embedding flag to check if we should warm up model with embedding or not

tikikun

LGTM

vansangpfiev force-pushed the fix-embed branch from 6613214 to 65ef821 Compare April 12, 2024 22:08

vansangpfiev added 5 commits April 15, 2024 16:37

fix: embed, log for testing

df7d27d

fix: warmup model

ff67a5b

fix: change log level

4c08794

fix: restore kv_cache_clear

30abf3a

fix: log_disable

a2cb6ba

vansangpfiev force-pushed the fix-embed branch from 5630c18 to a2cb6ba Compare April 15, 2024 09:41

vansangpfiev commented Apr 15, 2024

View reviewed changes

vansangpfiev changed the title ~~emb: test~~ fix: embedding fixup for nomic-embed-text model Apr 15, 2024

vansangpfiev marked this pull request as ready for review April 15, 2024 09:49

vansangpfiev commented Apr 15, 2024

View reviewed changes

fix: use params.embedding to decide warmup or not

8338661

vansangpfiev force-pushed the fix-embed branch from 8527a17 to 8338661 Compare April 15, 2024 11:05

fix: add model_type

c7bb799

tikikun self-requested a review April 16, 2024 00:57

tikikun approved these changes Apr 16, 2024

View reviewed changes

tikikun merged commit 7ae9928 into main Apr 16, 2024

vansangpfiev deleted the fix-embed branch July 8, 2024 05:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: embedding fixup for nomic-embed-text model #504

fix: embedding fixup for nomic-embed-text model #504

Uh oh!

vansangpfiev commented Apr 12, 2024 •

edited

Loading

Uh oh!

vansangpfiev Apr 15, 2024

Uh oh!

vansangpfiev Apr 15, 2024

Uh oh!

tikikun left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: embedding fixup for nomic-embed-text model #504

fix: embedding fixup for nomic-embed-text model #504

Uh oh!

Conversation

vansangpfiev commented Apr 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vansangpfiev Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

vansangpfiev Apr 15, 2024

Choose a reason for hiding this comment

Uh oh!

tikikun left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vansangpfiev commented Apr 12, 2024 •

edited

Loading